node content
Feature-Attention Graph Convolutional Networks for Noise Resilient Learning
Shi, Min, Tang, Yufei, Zhu, Xingquan, Liu, Jianxun
--Noise and inconsistency commonly exist in real-world information networks, due to inherent error-prone nature of human or user privacy concerns. T o date, tremendous efforts have been made to advance feature learning from networks, including the most recent Graph Convolutional Networks (GCN) or attention GCN, by integrating node content and topology structures. However, all existing methods consider networks as error-free sources and treat feature content in each node as independent and equally important to model node relations. The erroneous node content, combined with sparse features, provide essential challenges for existing methods to be used on real-world noisy networks. In this paper, we propose F A-GCN, a feature-attention graph convolution learning framework, to handle networks with noisy and sparse node content. T o tackle noise and sparse content in each node, F A-GCN first employs a long short-term memory (LSTM) network to learn dense representation for each feature. T o model interactions between neighboring nodes, a feature-attention mechanism is introduced to allow neighboring nodes learn and vary feature importance, with respect to their connections. By using spectral-based graph convolution aggregation process, each node is allowed to concentrate more on the most determining neighborhood features aligned with the corresponding learning task. I NTRODUCTION M ANY real-world applications involve knowledge mining and analysis from network or graph-based data such as citation networks, social networks, telecommunication networks, and biological networks, etc, where data are often collected from noisy channels with erroneous/inconsistent labels or features [1]. In order to carry out pattern mining from networks, such as community detection [2], node classification [3], link prediction [4], etc., network representation learning (or embedding learning) [5] is commonly used to construct features to represent nodes for learning.
Learning Graph Embedding with Adversarial Training Methods
Pan, Shirui, Hu, Ruiqi, Fung, Sai-fu, Long, Guodong, Jiang, Jing, Zhang, Chengqi
Graph embedding aims to transfer a graph into vectors to facilitate subsequent graph analytics tasks like link prediction and graph clustering. Most approaches on graph embedding focus on preserving the graph structure or minimizing the reconstruction errors for graph data. They have mostly overlooked the embedding distribution of the latent codes, which unfortunately may lead to inferior representation in many cases. In this paper, we present a novel adversarially regularized framework for graph embedding. By employing the graph convolutional network as an encoder, our framework embeds the topological information and node content into a vector representation, from which a graph decoder is further built to reconstruct the input graph. The adversarial training principle is applied to enforce our latent codes to match a prior Gaussian or Uniform distribution. Based on this framework, we derive two variants of adversarial models, the adversarially regularized graph autoencoder (ARGA) and its variational version, adversarially regularized variational graph autoencoder (ARVGA), to learn the graph embedding effectively. We also exploit other potential variations of ARGA and ARVGA to get a deeper understanding on our designs. Experimental results compared among twelve algorithms for link prediction and twenty algorithms for graph clustering validate our solutions.
Universal Network Representation for Heterogeneous Information Networks
Hu, Ruiqi, Yu, Celina Ping, Fung, Sai-Fu, Pan, Shirui, Wang, Haishuai, Long, Guodong
Network representation aims to represent the nodes in a network as continuous and compact vectors, and has attracted much attention in recent years due to its ability to capture complex structure relationships inside networks. However, existing network representation methods are commonly designed for homogeneous information networks where all the nodes (entities) of a network are of the same type, e.g., papers in a citation network. In this paper, we propose a universal network representation approach (UNRA), that represents different types of nodes in heterogeneous information networks in a continuous and common vector space. The UNRA is built on our latest mutually updated neural language module, which simultaneously captures inter-relationship among homogeneous nodes and node-content correlation. Relationships between different types of nodes are also assembled and learned in a unified framework. Experiments validate that the UNRA achieves outstanding performance, compared to six other state-of-the-art algorithms, in node representation, node classification, and network visualization. In node classification, the UNRA achieves a 3\% to 132\% performance improvement in terms of accuracy.
Joint Identification of Network Communities and Semantics via Integrative Modeling of Network Topologies and Node Contents
He, Dongxiao (Tianjin University) | Feng, Zhiyong ( Tianjin University ) | Jin, Di (Tianjin University) | Wang, Xiaobao (Tianjin University) | Zhang, Weixiong (Washington University in St. Louis)
The objective of discovering network communities, an essential step in complex systems analysis, is two-fold: identification of functional modules and their semantics at the same time. However, most existing community-finding methods have focused on finding communities using network topologies, and the problem of extracting module semantics has not been well studied and node contents, which often contain semantic information of nodes and networks, have not been fully utilized. We considered the problem of identifying network communities and module semantics at the same time. We introduced a novel generative model with two closely correlated parts, one for communities and the other for semantics. We developed a co-learning strategy to jointly train the two parts of the model by combining a nested EM algorithm and belief propagation. By extracting the latent correlation between the two parts, our new method is not only robust for finding communities and semantics, but also able to provide more than one semantic explanation to a community. We evaluated the new method on artificial benchmarks and analyzed the semantic interpretability by a case study. We compared the new method with eight state-of-the-art methods on ten real-world networks, showing its superior performance over the existing methods.